Corpus: spa_web_2011_300K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 96 96 97 97 97
1000 898 968 988 992 995
10000 6357 8732 9628 9886 9941
100000 33920 68943 88820 96397 97822
1000000 71056 176112 250663 283804 292208


Zipf's diagram for sentence endings


Gnuplot diagram

16978 msec needed at 2021-09-19 22:11